Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

peepopt shiftadd: Only match for sufficiently small constant widths #4448

Merged
merged 1 commit into from
Nov 20, 2024

Conversation

georgerennie
Copy link
Collaborator

This addresses issue #4445. There are two issues from that, one being that casting a constant of arbitrary size to an int can overflow, and the other being that the shiftadd peephole optimization creates logic that can be very large for large shift constants.

This is a simple fix to both that just disables the optimization for constants over a certain bitwidth. I have arbitrarily chosen 24 bits here but am happy to change that or discuss a more fine-grained solution than just disabling the optimization. I suspect most user code will not have offsets outside of this range.

@phsauter
Copy link
Contributor

As mentioned in the issue:
Verilog-2005 specifies in 4.3.1 the following:
The smallest allowed upper limit on the size of a vector (number of wires/bits) is 65536 ($2^{16}$).

So a $2^{24}$ limit is certainly a good general choice.
Since this is only an optimization and it works without it, it would also be legal to select a smaller upper limit, this would be a good idea if we think it could degrade performance (of the circuit or the synthesis process).

In this particular case I could see it degrading the synthesis process in particular during techmap.
Shift operations of the form data >> ... -c are transformed into a padded data instead. This is fine in isolation but then during techmap it seems to first build the full barrel shifter and only then optimize it using the constant input signals (padding of data). With a shift of $2^{24}/24-1$* this would create tens of millions of $_MUX_ cells and then optimize them away.
I don't know how big of an impact this has on the runtime and peak memory.

The 'nice' way to solve this would probably involve mapping shift operations first to $mux instead and then during the simple-mapping of $mux it is possible to check if A==B and if this is the case, skip creating this $_MUX_ cell and instead just connect the wires.
The simpler way is to measure the performance impact of a max-size shift, its currently running.

* something in techmap checks the expression width and limits it to 24-bit apparently, hence a smaller limit applies or we run into an error:

ERROR: Expression width 50331679 exceeds implementation limit of 16777216!

Copy link
Collaborator

@widlarizer widlarizer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since #4455, shift size is sufficiently limited, and this safety limit is safe to implement

@widlarizer widlarizer merged commit 56b80bd into YosysHQ:main Nov 20, 2024
21 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants